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LOCUS ECOLAC 7477 bp DNA linear BCT 05-MAY-1993 

DEFINITION E.coli lactose operon with lad, lacZ, lacY and lacA genes. 
ACCESSION J01636 J01637 K01483 K01793 
VERSION J01636.1 GI: 146575 

KEYWORDS acetyltransf erase; beta-D-galactosidase; galactosidase; lac operon; 

lac repressor protein; lacA gene; lad gene; lacY gene; lacZ gene; 

lactose permease; mutagenesis; palindrome; promoter region; 

thiogalactoside acetyltransf erase . 
SOURCE Escherichia coli 

ORGANISM Escherichia coli 

Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; 

Enterobacteriaceae; Escherichia . 
REFERENCE 1 (bases 1243 to 1266) 
AUTHORS Gilbert, W. and Maxam,A. 

TITLE The nucleotide sequence of the lac operator 

JOURNAL Proc. Natl. Acad. Sci . U.S.A. 70 (12), 3581-3584 (1973) 
MEDLINE 74055539 
PUBMED 4587255 
REFERENCE 2 (bases 124 6 to 1308) 
AUTHORS Maizels,N.M. 

TITLE The nucleotide sequence of the lactose messenger ribonucleic acid 

transcribed from the UV5 promoter mutant of Escherichia coli 
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 70 (12), 3585-3589 (1973) 
MEDLINE 74055540 
PUBMED 4587256 
REFERENCE 3 (sites) 

AUTHORS Gilbert, W., Maizels,N. and Maxam,A. 

TITLE Sequences of controlling regions of the lactose operon 

JOURNAL Cold Spring Harb. Symp. Quant. Biol. 38, 845-855 (1974) 
MEDLINE 74174501 
PUBMED 4598642 
REFERENCE 4 (sites) 

AUTHORS Gilbert, W., Gralla,J., Majors, A. J. and Maxam,A. 

TITLE Lactose operator sequences and the action of lac repressor 

JOURNAL (in) Sund, H. and Blauer,G. (Eds.); 

PROTEIN-LIGAND INTERACTIONS: 193-207; 
Walter de Gruyter, New York (1975) 
REFERENCE 5 (bases 1146 to 1282) 

AUTHORS Dickson, R.C. , Abelson,J., Barnes, W.M. and Reznikof f , W. S . 
TITLE Genetic regulation: the Lac control region 

JOURNAL Science 187 (4171), 27-35 (1975) 
MEDLINE 75048325 
PUBMED 1088926 
REFERENCE 6 (bases 1227 to 1271) 

AUTHORS Gilbert, W., Maxam,A. and Mirzabekov, A. 

TITLE Contacts between the lac repressor and DNA revealed by methylation 

JOURNAL (in) Kjeldgaard, N.C. and Maaloe,0. (Eds.); 

CONTROL OF RIBOSOME SYNTHESIS: 138-143; 

Academic Press, New York (1976) 
REFERENCE 7 (sites) 

AUTHORS Marians, K.J. and Brooker,J.D. 
TITLE Structure of the lactose operator 

JOURNAL Nature 260 (5549), 360-363 (1976) 
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76150089 
768781 

8 (bases 1242 to 1268) 
Heyneker, H. L. , Shine, J. , Goodman, H .M. 
Dicker son, R.E. , Narang, S . A. , Itakura, K. , 
Synthetic lac operator DNA is functional 
Nature 263 (5580), 748-752 (1976) 
77056376 

1069185 

9 (sites) 

Dickson, R.C. , Abelson,J. and Johnson, P. 
Nucleotide sequence changes produced by 
promoter of Escherichia coli 
J. Mol. Biol. Ill (1), 65-75 (1977) 
77168230 
323498 

10 (bases 51 to 264) 
Steege, D. A. 

5 '-Terminal nucleotide 
repressor mRNA: features 
reinitiation sites 



Boyer, H.W. , Rosenberg, J. 
, Lin,S. and Riggs,A.D. 
in vivo 



mutations in the lac 



sequence of Escherichia coli lactose 
of translational initiation and 



Proc. Natl, 

78052881 

337294 



Acad. Sci. U.S.A. 74 (10), 4163-4167 (1977) 



11 (bases 1 to 81) 
Calos, M. P . 

DNA sequence for a low-level promoter of the lac repressor gene and 

an 'up 1 promoter mutation 

Nature 274 (5673), 762-765 (1978) 

78246990 

355890 

12 (bases 49 to 1161) 
Farabaugh, P.J. 
Sequence of the lad gene 
Nature 274 (5673), 765-769 (1978) 
78246991 

355891 

13 (sites) 

Miller, J. H., Coulondre,C. and Farabaugh, P . J. 

Correlation of nonsense sites in the lad gene with specific codons 

in the nucleotide sequence 

Nature 274 (5673), 770-775 (1978) 

78246992 

355892 

14 (sites) 

Calos, M. P. and Miller, J. H. 

DNA sequence alteration resulting from a mutation impairing 

promoter function in the lac repressor gene 

Mol. Gen. Genet. 178 (1), 225-227 (1980) 

80209248 

6770231 

15 (bases 4306 to 5804) 

Buchel,D.E., Gronenborn, B . and Muller-Hill, B . 

Sequence of the lactose permease gene 

Nature 283 (5747), 541-545 (1980) 

80120651 

6444453 

16 (sites) 

Miller, J. H., Calos, M. P. and Galas, D.J. 
Genetic and sequencing studies of the specificity 
into the lac region of E. coli 

Cold Spring Harb. Symp. Quant. Biol. 4 5 Pt 1, 24 3-257 
82049502 



of transposition 
1981) 



6271472 
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17 (sites) 

Chenchick, A. , Beabealashvilli, R. and Mirzabekov, A. 

Topography of interaction of Escherichia coli RNA polymerase 

subunits with lac UV5 promoter 

FEBS Lett. 128 (1), 46-50 (1981) 

82004657 

7023981 

18 (sites) 

Betz,J.L. and Sadler, J. R. 

Variants of a cloned synthetic lactose operator. I. A palindromic 

dimer lactose operator derived from one stand of the cloned 4 0-base 

pair operator 

Gene 13 (1) , 1-12 (1981) 

81213459 

7016667 

19 (sites) 

Sadler, J. R. and Tecklenburg, M. 

Cloning and characterization of the natural lactose operator 

Gene 13 (1), 13-23 (1981) 

81213463 

6263752 

20 (sites) 

Betz,J.L. and Sadler, J. R. 

Variants of a cloned synthetic lactose operator. II. 

Chloramphenicol-resistant revertants retaining a lactose operator 

in the CAT gene of plasmid pBR325 

Gene 15 (2-3), 187-200 (1981) 

82051311 

6271642 

21 (sites) 

Calos,M.P. and Miller, J. H. 

The DNA sequence change resulting from the IQ1 mutation, which 

greatly increases promoter strength 

Mol. Gen. Genet. 183 (3), 559-560 (1981) 

82147746 

7038381 

22 (sites) 

Mieschendahl, M. , Buchel,D., Bocklage,H. and Muller-Hill , B . 
Mutations in the lacY gene of Escherichia coli define functional 
organization of lactose permease 

Proc. Natl. Acad. Sci. U.S.A. 78 (12), 7652-7656 (1981) 

82150928 

6278484 

23 (sites) 

Russell, D.R. and Bennett, G.N. 

Construction and analysis of in vivo activity of E. coli promoter 

hybrids and promoter mutants that alter the -35 to -10 spacing 

Gene 20 (2), 231-243 (1982) 

83158761 

6299890 

24 (sites) 

Horowitz, H. and Piatt, T. 

A termination site for LacI transcription is between the CAP site 
and the lac promoter 

J. Biol. Chem. 257 (19), 11740-11746 (1982) 

83007251 

6288696 

25 (sites) 

Klein, R.D. and Wells, R.D. 

Effects of neighboring DNA homopolymers on the biochemical and 

physical properties of the Escherichia coli lactose promoter. I. 

Cloning and characterization studies 

J. Biol. Chem. 257 (21), 12954-12961 (1982)- 

83030833 
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extensive mutagenesis of a lac 
79 (5), 1408-1412 (1982) 



6290487 

26 (bases 1183 to 1291) 
Weiher , H . and Schaller , H . 
Segment-specific mutagenesis: 
promoter/operator element 
Proc. Natl. Acad. Sci. U.S.A. 
82174608 

7041119 

27 (sites) 

Van Dyke,M.W. and Dervan,P.B. 

Footprinting with MPE.Fe(II). Complementary-strand analyses of 
distamycin- and actinomycin-binding sites on heterogeneous DNA 
Cold Spring Harb. Symp. Quant. Biol. 47 Pt 1, 347-353 (1983) 
83233528 
6305557 

28 (bases 1287 to 4364) 

Kalnins,A., Otto,K., Ruther,U. and Muller-Hill, B . 

Sequence of the lacZ gene of Escherichia coli 

EMBO J. 2 (4), 593-597 (1983) 

84028567 

6313347 

29 (sites) 

Cone,K.C, Sellitti, M . A. and Steege,D.A. 

Lac repressor mRNA transcription terminates in vivo in the lac 
control region 

J. Biol. Chem. 258 (18), 11296-11304 (1983) 

83291093 

6309841 

30 (sites) 

Sadler, J. R. , Sasmor,H. and Betz,J.L. 

A perfectly symmetric lac operator binds the lac repressor very 
tightly 

Proc. Natl. Acad. Sci. U.S.A. 80 (22), 6785-6789 (1983) 

84070714 

6316325 

31 (sites) 
Glickman, B . W 



a role for 



and Ripley, L. S . 
Structural intermediates of deletion mutagenesis: 
palindromic DNA 

Proc. Natl. Acad. Sci. U.S.A. 81 (2), 512-516 (1984) 

84119517 

6582506 

32 (sites) 

Spassky,A., Kirkegaard, K. and Buc,H. 

Changes in the DNA structure of the lac UV5 promoter during 

formation of an open complex with Escherichia coli RNA polymerase 

Biochemistry 24 (11), 2723-2731 (1985) 

85280412 

3896305 

33 (sites) 

Straney,D.C. and Crothers, D.M. 

Intermediates in transcription initiation from the E. coli lac UV5 
promoter 

Cell 43 (2 Pt 1), 449-459 (1985) 

86079527 

2416465 

34 (sites) 

Looman,A.C, de Gruyter,M., Vogelaar,A. and van Knippenberg, P . H . 

Effects of heterologous ribosomal binding sites on the 

transcription and translation of the lacZ gene of Escherichia coli 

Gene 37 (1-3), 145-154 (1985) 

86031346 

3932130 

35 (sites) 
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TITLE 
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Mandecki, W. , Goldman, R. A. , Powell, B.S. and Caruthers, M. H . 
lac Up-promoter mutants with increased homology to the consensus 
promoter sequence 

J. Bacteriol. 164 (3), 1353-1355 (1985) 
86059235 
2999082 

36 (sites) 

Malamy,M.H., Rahaim, P.T., Hof fman, C . S . , Baghdoyan, D. , O* Connor, M.] 
and Miller, J. F. 

A frameshift mutation at the junction of an IS1 insertion within 
lacZ restores beta-galactosidase activity via formation of an 
active lacZ-ISl fusion protein 
J. Mol. Biol. 181 (4), 551-555 (1985) 
85210885 
2987506 

37 (bases 5646 to 7477) 

Hediger, M. A. , Johnson, D. F. , Nierlich, D. P. and Zabin,I. 
DNA sequence of the lactose operon: the lacA gene and the 
transcriptional termination region 

Proc. Natl. Acad. Sci . U.S.A. 82 (19), 6414-6418 (1985) 
86016712 
3901000 

Original source text: Escherichia coli DNA; mRNA; clone 
lambda-h80dlac DNA; clone puk217; pgm8 (see comment) . 
[3] sites; UV5 mRNA transcripts and operator mutants, [(in) 
Sund,H. and Blauer,G. (eds . ) ; Protein-Ligand Interactions: 
193-207;Walter de] sites; operator mutational analysis. [7] 
sites; SI and mung bean nuclease action on operator DNA. [9] 
sites; class I, II and III promoter mutant analysis. [13] sites; 
lad mutant analysis. 

[16] sites; Tn5, Tn9 and TnlO insertion sites in lac region. [14; 
sites; lad promoter mutation UJ177. 
[18] sites; palindromic dimer operator;. 
[19] sites; natural operator sequence. 
[20] sites; operator mutational analysis. 
[21] sites; lacI-Q deletion. 

[17] sites; RNA polymerase UV5 promoter interaction. 
lacY mutational analysis. 

[24] sites; lacl-promoted transcription termination, 
wt and UV5 promoter sequence studies. [23] 
mutational analysis . 

[30] sites; perfectly symmetric operator sequence, 
lad mRNA termination site. 

[27] sites; distamycin and actinomycin binding to promoter. [31] 

sites; lad deletion studies. 

[35] sites; promoter mutational studies. 

[33] sites; DNAase I studies with promoter sequence. [34] sites; 
ribosomal binding and translation initiation for lacZ. [36] sites; 
insertion sequence IS1 integration in lacZ; . [32] sites; DNAase I 
studies with promoter. 

[1] first reports a 27 bp operator (sites 1240-1266) with two-fold 
symmetries; the operator has also been defined to be bases 
1246-1266 or bases 1239-1273 [8]. [(in) Kjeldgaard, N.C. and Maaloe, 
0. (eds) ; Control of ribosome synthesis: 138-143;A] explores the 
ability of lac 

repressor protein to affect methylation of operator DNA. [8] 
argues that DNA on both sides of the 21 bp operator (bases 
1246-1266) affects repressor binding but that the sequences of this 
DNA are probably not critical. [5] gives a larger sequence known as 
the promoter-operator region for the wild-type, whereas [2] and 
[26] give portions of this region for the mutant strain UV5. Within 
the promoter region, bases 1162-1199 are identified as the 
catabolite gene activator protein binding site (cap) and bases 
1200-1245 are the RNA polymerase interaction site. [10] reports a 



[22] sites; 



[25] sites; 



sites; UV5 promoter 
[29] sites; 
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sequence for the 5' end of the lad (repressor) gene and discusses 
restart in mutant strains. [11] presents a sequence for the lad 
promoter region and identifies an I-Q mutation which enhances lad 
transcription approximately ten-fold. [12] gives a complete 
sequence 

for lad which agrees with the known lac repressor sequence. [26] 
examines the promoter-operator region in the UV5 strain (lacl09) 
and studies 23 mutant derivatives of this sequence. This sequence 
agrees with known protein sequences for the lacZ, lacY and lacA 
enzymes. [15] notes that the fMet codon is not present 
for lacA and suggests that the 'ttg* codon (5727-5729), which 
immediately precedes the mature N-terminal asparagine codon, is the 
start codon. The cds for lacZ, lacY and lacA are included on a 
single mRNA transcript. 
Complete source information: 

Escherichia coli DNA [l],[(in) Kj eldgaard, N . C . and Maaloe, 0. (eds ) ; 



138-143/A] , [8] , [12] , [26] ; mRNA [2] 
clone puk217 [28]; pgm8 [37], 



FEATURES 

source 



variation 



mRNA 



gene 



CDS 



nose 
■ 



variation 



variation 



variation 



variation 



variation 



variation 



mRNA 



Control of ribosome synthesis: 
[5] , [10] ; clone 
lambda-h80dlac DNA [11], [15]; 

Location/Qualifiers 

1. .7477 

/organism="Escherichia coli" 
/mol_type="genomic DNA" 
/db_xref="taxon:562" 

16 ; 

/note="c in wild-type; t in 'up' promoter mutant I-Q [11]" 
51. .1230 

/note="lad (repressor) mRNA; preferred in vivo 3' end 
[12], [29]" 
79. .1161 
/gene="lacl" 
79. .1161 
/gene="lacl" 

/note="lac repressor protein (gtg start codon) " 
/codon_start=l 
/transl_table=ll 
/protein_id=" AAA24 052 . 1 " 
/db_xref="GI: 146576" 

/trans lation="MKPVTLYDVAEYAGVSYQTVSRWNQASHVSAKTREKVEAAMAE 
LNYIPNRVAQQLAGKQSLLIGVATSSLALHAPSQIVAAIKSRADQLGASVWSMVERS 
GVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAACTNVPALFLDVSDQTPINSII 
FSHEDGTRLGVEHLVALGHQQIALLAGPLSSVSARLRLAGWHKYLTRNQIQPIAEREG 
DWSAMSGFQQTMQMLNEGIVPTAMLVANDQMALGAMRAITESGLRVGADISVVGYDDT 
EDSSCYIPPSTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNT 
QTASPRALADSLMQLARQVSRLESGQ" 
1162. .1199 

/note="cap protein binding site" 
1183. .1186 

/note="ttag in wild-type; aatt in strain UV5 [26]" 
1209. .1211 

/note="gct in wild-type; gt in mutant 1305 [5]" 
1212 

/note="t in wild-type; a in mutant 1241 [5]" 
1230 

/note="c in wild-type; a in mutant p-r-la [5]" 
1237. .1238 

/note="gt in wild-type; aa in strain UV5 [26]" 
1242.. 1245 

in wild-type; ttca in synthetic operator [8]" 



signal 



misc_signal 



variation 



/note="gtgg 
1246..>4358 

/note="lacZ mRNA [2], [5]" 
1246. .1266 

/note="lac repressor protein binding site" 
1267. .1268 
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/note="tc in wild-type; tg in synthetic operator [8]" 
variation 1282. .1291 

/note="ctatgaccat in wild-type; gatccggcca in strain UV5 
[26]" 

gene 1284. .4358 

/gene="lacZ" 
CDS 1284.. 4358 

/gene="lacZ" 

/note="beta-d-galactosidase" 

/ codon_start=l 

/transl_table=ll 

/prot e in_id= " AAA24053.1 " 

/db_xref="GI : 14 6577" 

/trans lation="MTMITDSLAWLQRRDWENPGVTQLNRLAAHPPFASWRNSEEAR 
TDRPSQQLRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVWPSNWQMHGYDAPIYT 
NVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAFHLWCNGRWV 
GYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQDMWRMSGIFRDVSLLHK 
PTTQISDFHVATRFNDDFSRAVLEAEVQMCGELRDYLRVTVSLWQGETQVASGTAPFG 
GEI I DERGGYADRVTLRLNVENPKLWSAEI PNLYRAWELHTADGTLIEAEACDVGFR 
EVRIENGLLLLNGKPLLIRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSH 
YPNHPLWYTLCDRYGLYWDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRN 
HPSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIICPMYARVD 
EDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFAKYWQAFRQYPRLQGG 
FVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQFCMNGLVFADRTPHPALTEAKHQ 
QQFFQFRLSGQTIEVTSEYLFRHSDNELLHWMVALDGKPLASGEVPLDVAPQGKQLIE 
LPELPQPESAGQLWLTVRWQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPH 
LTTSEMDFCIELGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSE 
ATRIDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLFISRKT 
YRI DGS GQMAI T VDVE VAS DT PH PARI GLNCQLAQVAERVNWLGLG PQEN Y P DRLTAA 
CFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQWRGDFQFNISRYSQQQLMETS 
HRHLLHAEEGTWLNIDGFHMGIGGDDSWSPSVSAEFQLSAGRYHYQLVWCQK" 

gene 4410. .5663 

/gene="lacY" 

CDS 4410. .5663 

/gene="lacY" 

/note="lactose permease" 
/codon_start=l 
/transl_table=ll 
/protein id=" AAA24054 . 1 " 
/db_xref="GI: 146578" 

/trans la tion="MYYLKNTNFWMFGLFFFFYFFIMGAYFPFFPIWLHDINHISKSD 
TGIIFAAISLFSLLFQPLFGLLSDKLGLRKYLLWIITGMLVMFAPFFIFIFGPLLQYN 
I LVGS I VGGI YLG FC FNAGAPAVEAFI EKVSRRSNFE FGRARMFGC VGWALCAS I VG I 
MFTINNQFVFWLGSGCALILAVLLFFAKTDAPSSATVANAVGANHSAFSLKLALELFR 
QPKLWFLSLYVIGVSCTYDVFDQQFANFFTSFFATGEQGTRVFGYVTTMGELLNASIM 
FFAPLIINRIGGKNALLLAGTIMSVRIIGSSFATSALEVVILKTLHMFEVPFLLVGCF 
KYITSQFEVRFSATIYLVCFCFFKQLAMIFMSVLAGNMYESIGFQGAYLVLGLVALGF 
TLISVFTLSGPGPLSLLRRQVNEVA" 

gene 5727.. 6338 

/gene="lacA" 

CDS 5727.. 6338 

/gene="lacA" 

/note="thiogalactoside acetyltransf erase (ttg start 
codon) " 

/ codon_start=l 
/transl_table=ll 
/protein_id=" AAA24055. 1 " 
/db_xref="GI: 551814" 

/translation="MNMPMTERIRAGKLFT DMCEGLPEKRLRGKTLMYEFNHSHPSEV 
EKRESLIKEMFATVGENAWVEPPVYFSYGSNIHIGRNFYANFNLTIVDDYTVTIGDNV 
LIAPNVTLSVTGHPVHHELRKNGEMYSFPITIGNNVWIGSHWINPGVTIGDNSVIGA 
GSIVTKDIPPNWAAGVPCRVIREINDRDKHYYFKDYKVESSV" 
ORIGIN Hindll site [Nature 274, 762-765 (1978)]. 

1 gacaccatcg aatggcgcaa aacctttcgc ggtatggcat gatagcgccc ggaagagagt 
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61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 



caattcaggg 
gtctcttatc 
cgggaaaaag 
caactggcgg 
gcgccgtcgc 
gtggtgtcga 
ctcgcgcaac 
gctgtggaag 
cccatcaaca 
gtcgcattgg 
cgtctgcgtc 
gaacgggaag 
gagggcatcg 
cgcgccatta 
gataccgaag 
ctgctggggc 
ggcaatcagc 
caaaccgcct 
cgactggaaa 
accccaggct 
acaatttcac 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
gctggctgga 
tgcacggtta 
cgtttgttcc 
gctggctaca 
tgtggtgcaa 
acctgagcgc 
gtgacggcag 
tctcgttgct 
atgatgattt 
actacctacg 
cgcctttcgg 
gtctgaacgt 
tggttgaact 
tccgcgaggt 
gaggcgttaa 
tggtgcagga 
atccgaacca 
aagccaatat 
ggctaccggc 
gtgtgatcat 
atcgctggat 
acaccacggc 
tcccggctgt 
cgctgatcct 
actggcaggc 
atcagtcgct 
ttggcgatac 
cgccgcatcc 
ccgggcaaac 
tgcactggat 
tcgctccaca 
ggcaactctg 
ggcacatcag 
ccgcgtccca 
gtaataagcg 
ataaaaaaca 
acattggcgt 
cggcgggcca 
atgcggtgct 
gccggaaaac 



tggtgaatgt 
agaccgtttc 
tggaagcggc 
gcaaacagtc 
aaattgtcgc 
tggtagaacg 
gcgtcagtgg 
ctgcctgcac 
gtattatttt 
gtcaccagca 
tggctggctg 
gcgactggag 
ttcccactgc 
ccgagtccgg 
acagctcatg 
aaaccagcgt 
tgttgcccgt 
ctccccgcgc 
gcgggcagtg 
ttacacttta 
acaggaaaca 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
gtgcgatctt 
cgatgcgccc 
cacggagaat 
ggaaggccag 
cgggcgctgg 
atttttacgc 
ttatctggaa 
gcataaaccg 
cagccgcgct 
ggtaacagtt 
cggtgaaatt 
cgaaaacccg 
gcacaccgcc 
gcggattgaa 
ccgtcacgag 
tatcctgctg 
tccgctgtgg 
tgaaacccac 
gatgagcgaa 
ctggtcgctg 
caaatctgtc 
caccgatatt 
gccgaaatgg 
ttgcgaatac 
gtttcgtcag 
gattaaatat 
gccgaacgat 
agcgctgacg 
catcgaagtg 
ggtggcgctg 
aggtaaacag 
gctcacagta 
cgcctggcag 
cgccatcccg 
ttggcaattt 
actgctgacg 
aagtgaagcg 
ttaccaggcc 
gattacgacc 
ctaccggatt 



gaaaccagta 
ccgcgtggtg 
gatggcggag 
gttgctgatt 
ggcgattaaa 
aagcggcgtc 
gctgatcatt 
taatgttccg 
ctcccatgaa 
aatcgcgctg 
gcataaatat 
tgccatgtcc 
gatgctggtt 
gctgcgcgtt 
ttatatcccg 
ggaccgcttg 
ctcactggtg 
gttggccgat 
agcgcaacgc 
tgcttccggc 
gctatgacca 
cctggcgtta 
agcgaagagg 
cgctttgcct 
cctgaggccg 
atctacacca 
ccgacgggtt 
acgcgaatta 
gtcggttacg 
gccggagaaa 
gatcaggata 
actacacaaa 
gtactggagg 
tctttatggc 
atcgatgagc 
aaactgtgga 
gacggcacgc 
aatggtctgc 
catcatcctc 
atgaagcaga 
tacacgctgt 
ggcatggtgc 
cgcgtaacgc 
gggaatgaat 
gatccttccc 
atttgcccga 
tccatcaaaa 
gcccacgcga 
tatccccgtt 
gatgaaaacg 
cgccagttct 
gaagcaaaac 
accagcgaat 
gatggtaagc 
ttgattgaac 
cgcgtagtgc 
cagtggcgtc 
catctgacca 
aaccgccagt 
ccgctgcgcg 
acccgcattg 
gaagcagcgt 
gctcacgcgt 
gatggtagtg 



acgttatacg 
aaccaggcca 
ctgaattaca 
ggcgttgcca 
tctcgcgccg 
gaagcctgta 
aactatccgc 
gcgttatttc 
gacggtacgc 
ttagcgggcc 
ctcactcgca 
ggttttcaac 
gccaacgatc 
ggtgcggata 
ccgtcaacca 
ctgcaactct 
aaaagaaaaa 
tcattaatgc 
aattaatgtg 
tcgtatgttg 
tgattacgga 
cccaacttaa 
cccgcaccga 
ggtttccggc 
atactgtcgt 
acgtaaccta 
gttactcgct 
tttttgatgg 
gccaggacag 
accgcctcgc 
tgtggcggat 
tcagcgattt 
ctgaagttca 
agggtgaaac 
gtggtggtta 
gcgccgaaat 
tgattgaagc 
tgctgctgaa 
tgcatggtca 
acaactttaa 
gcgaccgcta 
caatgaatcg 
gaatggtgca 
caggccacgg 
gcccggtgca 
tgtacgcgcg 
aatggctttc 
tgggtaacag 
tacagggcgg 
gcaacccgtg 
gtatgaacgg 
accagcagca 
acctgttccg 
cgctggcaag 
tgcctgaact 
aaccgaacgc 
tggcggaaaa 
ccagcgaaat 
caggctttct 
atcagttcac 
accctaacgc 
tgttgcagtg 
ggcagcatca 
gtcaaatggc 



atgtcgcaga 
gccacgtttc 
ttcccaaccg 
cctccagtct 
atcaactggg 
aagcggcggt 
tggatgacca 
ttgatgtctc 
gactgggcgt 
cattaagttc 
atcaaattca 
aaaccatgca 
agatggcgct 
tctcggtagt 
ccatcaaaca 
ctcagggcca 
ccaccctggc 
agctggcacg 
agttagctca 
tgtggaattg 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
accagaagcg 
cgtcccctca 
tcccattacg 
cacatttaat 
cgttaactcg 
tcgtttgccg 
ggtgatggtg 
gagcggcatt 
ccatgttgcc 
gatgtgcggc 
gcaggtcgcc 
tgccgatcgc 
cccgaatctc 
agaagcctgc 
cggcaagccg 
ggtcatggat 
cgccgtgcgc 
cggcctgtat 
tctgaccgat 
gcgcgatcgt 
cgctaatcac 
gtatgaaggc 
cgtggatgaa 
gctacctgga 
tcttggcggt 
cttcgtctgg 
gtcggcttac 
tctggtcttt 
gtttttccag 
tcatagcgat 
cggtgaagtg 
accgcagccg 
gaccgcatgg 
cctcagtgtg 
ggatttttgc 
ttcacagatg 
ccgtgcaccg 
ctgggtcgaa 
cacggcagat 
ggggaaaacc 
gattaccgtt 



gtatgccggt 
tgcgaaaacg 
cgtggcacaa 
ggccctgcac 
tgccagcgtg 
gcacaatctt 
ggatgccatt 
tgaccagaca 
ggagcatctg 
tgtctcggcg 
gccgatagcg 
aatgctgaat 
gggcgcaatg 
gggatacgac 
ggattttcgc 
ggcggtgaag 
gcccaatacg 
acaggtttcc 
ctcattaggc 
tgagcggata 
gtcgttttac 
gcacatcccc 
caacagttgc 
gtgccggaaa 
aactggcaga 
gtcaatccgc 
gttgatgaaa 
gcgtttcatc 
tctgaatttg 
ctgcgttgga 
ttccgtgacg 
actcgcttta 
gagttgcgtg 
agcggcaccg 
gtcacactac 
tatcgtgcgg 
gatgtcggtt 
ttgctgattc 
gagcagacga 
tgttcgcatt 
gtggtggatg 
gatccgcgct 
aatcacccga 
gacgcgctgt 
ggcggagccg 
gaccagccct 
gagacgcgcc 
ttcgctaaat 
gactgggtgg 
ggcggtgatt 
gccgaccgca 
ttccgtttat 
aacgagctcc 
cctctggatg 
gagagcgccg 
tcagaagccg 
acgctccccg 
atcgagctgg 
tggattggcg 
ctggataacg 
cgctggaagg 
acacttgctg 
ttatttatca 
gatgttgaag 
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3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 
5581 
5641 
5701 
5761 
5821 
5881 
5941 
6001 
6061 
6121 
6181 
6241 
6301 
6361 
6421 
6481 
6541 
6601 
6661 
6721 
6781 
6841 
6901 
6961 
7021 
7081 
7141 
7201 
7261 
7321 
7381 
7441 



// 



tggcgagcga 
cagagcgggt 
ccgcctgttt 
cgagcgaaaa 
gcggcgactt 
atcgccatct 
ggattggtgg 
gtcgctacca 
tgcccgtatt 
tgttcggttt 
cgatttggct 
ctatttctct 
ggctgcgcaa 
ttatttttat 
gtatttatct 
aagtcagccg 
gggcgctgtg 
ggctgggctc 
cgccctcttc 
agctggcact 
gcgtttcctg 
ttgctaccgg 
ttaacgcctc 
ccctgctgct 
cagcgctgga 
tgggctgctt 
tggtctgttt 
atatgtatga 
gcttcacctt 
gtcaggtgaa 
caacatatca 
gcaagctatt 
taatgtatga 
aagaaatgtt 
acggttccaa 
atgactacac 
ttacgggaca 
taacgattgg 
tcggggataa 
tcgtggcggc 
attatttcaa 
ctgcgcttat 
gcgcaggaac 
gcaaaacgca 
atcaggtaat 
atgataaaac 
gagtgatcga 
accatcgccc 
agttgtaaca 
ggcatcagca 
tactgcgcgc 
taaccgccgt 
acgcgaaccg 
caccaggcaa 
tttcgctatg 
gagccgcgga 
ccgaagcatc 
cagcagcgca 
actgatggcg 
cagcccgccc 



tacaccgcat 
aaactggctc 
tgaccgctgg 
cggtctgcgc 
ccagttcaac 
gctgcacgcg 
cgacgactcc 
ttaccagttg 
tcgcgtaagg 
attctttttc 
acatgacatc 
gttctcgcta 
atacctgctg 
cttcgggcca 
aggcttttgt 
tcgcagtaat 
tgcctcgatt 
tggctgtgca 
tgccacggtt 
ggaactgttc 
cacctacgat 
tgaacagggt 
gattatgttc 
ggctggcact 
agtggttatt 
taaatatatt 
ctgcttcttt 
aagcatcggt 
aatttccgtg 
tgaagtcgct 
taacggagtg 
taccgatatg 
gtttaatcac 
tgccacggta 
catccatata 
ggtaacaatc 
ccctgtacac 
caataacgtc 
ttctgttatt 
tggcgttcct 
agattataaa 
caggcctaca 
aagcgtcgca 
gggttatgat 
tgccgctgat 
cgattccctg 
gcgccagcag 
acaataccgg 
ccagcgccag 
aagctcctgc 
tggcaccaat 
taatcagacc 
gagtggttgt 
agagcgcaac 
ttgaactaac 
ccacagcccc 
accgcctgaa 
ctttgcgggt 
acactgcgac 
atggtaacca 



ccggcgcgga 
ggattagggc 
gatctgccat 
tgcgggacgc 
atcagccgct 
gaagaaggca 
tggagcccgt 
gtctggtgtc 
aaatccatta 
ttttactttt 
aaccatatca 
ttattccaac 
tggattatta 
ctgttacaat 
tttaacgccg 
ttcgaatttg 
gtcggcatca 
ctcatcctcg 
gccaatgcgg 
agacagccaa 
gtttttgacc 
acgcgggtat 
tttgcgccac 
attatgtctg 
ctgaaaacgc 
accagccagt 
aagcaactgg 
ttccagggcg 
ttcacgctta 
taagcaatca 
atcgcattga 
tgcgaaggct 
tcgcatccat 
ggggaaaacg 
ggccgcaatt 
ggtgataacg 
catgaattga 
tggatcggaa 
ggcgcgggta 
tgtcgggtta 
gttgaatcgt 
agttcagcga 
tcatgcctct 
catcagccca 
actacgcagc 
cataaacgcc 
caaacagagc 
caattgcatc 
cattaacagt 
ggcttgccca 
ctcaatatag 
gaagtaaaca 
tgtcttgtgg 
aacggcaggc 
cagggcgtta 
atcaccagtg 
tgatgccgat 
aaagctcacg 
gttcgctgac 
ccggcagagc 



ttggcctgaa 
cgcaagaaaa 
tgtcagacat 
gcgaattgaa 
acagtcaaca 
catggctgaa 
cagtatcggc 
aaaaataata 
tgtactattt 
ttatcatggg 
gcaaaagtga 
cgctgtttgg 
ccggcatgtt 
acaacatttt 
gtgcgccagc 
gtcgcgcgcg 
tgttcaccat 
ccgttttact 
taggtgccaa 
aactgtggtt 
aacagtttgc 
ttggctacgt 
tgatcattaa 
tacgtattat 
tgcatatgtt 
ttgaagtgcg 
cgatgatttt 
cttatctggt 
gcggccccgg 
atgtcggatg 
acatgccaat 
taccggaaaa 
cagaagttga 
cctgggtaga 
tttatgcaaa 
tactgattgc 
gaaaaaacgg 
gtcatgtggt 
gtatcgtcac 
ttcgcgaaat 
cagtttaaat 
tctacattag 
ttgacccaca 
acgacgcaca 
acgccagaaa 
accagcttgc 
ggaaacgcgc 
ggcagccaga 
ttgcgccgat 
agcgtcatca 
aaagcgggta 
cccagcgtcc 
gaagaggcga 
agcgccacca 
tggcggcacc 
gcgtgcgctg 
ccccacccca 
catcaatgca 
atgctgatga 
ggtcgac 



ctgccagctg 
ctatcccgac 
gtataccccg 
ttatggccca 
gcaactgatg 
tatcgacggt 
ggaattccag 
ataaccgggc 
aaaaaacaca 
agcctacttc 
tacgggtatt 
tctgctttct 
agtgatgttt 
agtaggatcg 
agtagaggca 
gatgtttggc 
caataatcag 
ctttttcgcc 
ccattcggca 
tttgtcactg 
taatttcttt 
aacgacaatg 
tcgcatcggt 
tggctcatcg 
tgaagtaccg 
tttttcagcg 
tatgtctgta 
gctgggtctg 
cccgctttcc 
cggcgcgacg 
gaccgaaaga 
aagacttcgt 
aaaaagagaa 
accgcctgtc 
tttcaattta 
acccaacgtt 
cgagatgtac 
tattaatcca 
aaaagacatt 
aaacgaccgg 
tataaaaatt 
ccgcatccgg 
gctgcggaaa 
gcgcatgaaa 
accacggggc 
cagcaatagc 
cgcccagacc 
taaagccgca 
cctgatggcg 
atgccagtaa 
accaggcaat 
acgcgcgggg 
cctcgcgggc 
ggcgagtgtt 
aagcccaccg 
ctgaaaccgc 
ccaagcagtg 
ccgacggcaa 
agccagcttc 



gcgcaggtag 
cgccttactg 
tacgtcttcc 
caccagtggc 
gaaaccagcc 
ttccatatgg 
ctgagcgccg 
aggccatgtc 
aacttttgga 
ccgtttttcc 
atttttgccg 
gacaaactcg 
gcgccgttct 
attgttggtg 
tttattgaga 
tgtgttggct 
tttgttttct 
aaaacggatg 
tttagcctta 
tatgttattg 
acttcgttct 
ggcgaattac 
gggaaaaacg 
ttcgccacct 
ttcctgctgg 
acgatttatc 
ctggcgggca 
gtggcgctgg 
ctgctgcgtc 
cttatccgac 
ataagagcag 
gggaaaacgt 
agcctgatta 
tatttctctt 
accattgtcg 
actctttccg 
tcttttccga 
ggcgtcacca 
ccaccaaacg 
gataagcact 
gcctgatacg 
catgaacaaa 
acgtactggt 
tgcccagtcc 
aagcccggcg 
cggttgcaca 
taacccacac 
gaaccccacc 
agccatagca 
ggaaccgctg 
caggctggcg 
agtgaatacc 
gctttgccac 
tgataccagg 
ccgcccatca 
cgtttaatca 
cgctgctaag 
tcagcaacag 
cggccagcgc 



Disclaimer | Write to the Help Desk 
http://wvm.ncbi.nlm.nih.gov/entrez/viewer.fcgi ?db=nucleotide&val= 146575 



02/04/2004 



- NCBr Sequence*Viewer 



NCBI | NLM | NIH 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db==nucleotide&val= 146575 



